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^> Abstract 

CN \ For a mixing and uniformly expanding interval map / : I — >■ I we pose the 

,__,■ following questions. For which permutation transformations a : I — > I is the 

composition a o / again mixing? When a o / is mixing, how does the mixing rate 

i—\ \ of a o / typically compare with that of /? 

r~| ' As a case study, we focus on the family of maps f(x) = mimodl for 

2 < m € N. We split [0, 1) into N equal subintervals, and take a to be a 
permutation of these. We analyse those a G Sn for which a o / is mixing, and 
show that, for large A, typical permutations will preserve the mixing property. 

^i ■ In contrast to the situation for continuous time diffusive systems, we find that 

£> , composition with a permutation cannot improve the mixing rate, but may make 

it worse. We obtain a precise result on the worst mixing rate which can occur as 

,__i ■ a varies, with m, N fixed and gcd(m, N) = 1. 

in : 

rn ! 

O 1 Introduction 

(N ; 

We consider a mixing dynamical system (/,/,//), where / : I — > I is an ergodic /x- 
measure preserving transformation on a compact interval /. If we divide the interval / 
up into N equal subintervals, we also consider a permutation transformation a : I — > I 
formed by permuting these N subintervals. The question we consider is the following: 
is the composition a o / still mixing for general permutations a? If mixing is preserved 
then does the mixing rate of co/, typically increase (or decrease) relative to applying / 
alone? The physical motivation for studying this kind of problem arises through studies 
of mixing rates for diffusive systems, see [2|. In this work, it is shown that the rate of 
mixing in a one dimensional, continuous time diffusive system can be accelerated by 
applying a permutation transformation at regular time steps t G N. Thus we consider 
this question when diffusive systems are replaced by uniformly expanding maps, which 
in themselves are regularly considered as canonical examples of mixing protocols [H] . 
A more general problem would be to consider mixing (and non-uniformly 
expanding) systems / : M — y M, with M a compact set, and compose / with a 



piecewise isometric map T : M — >■ M . We can then ask if To / is typically mixing, and 
moreover, when T o / is mixing does the mixing rate (typically) increase or decrease? 
Typical here, would mean that if / or T belong to some (parametrized) family of 
maps, then mixing is preserved for a generic (or full measure) set of parameters. 
Ergodic and topological properties of these kind of transformations (e.g. maps of 
the form x i— > (3x + a mod 1, < a < 1 < /3) have been studied, see for example 
0, [TTJ . Permutation groups also feature more generally in the study of combinatorial 
dynamical systems [JJ , and in the study of topological entropy [I] . 

In this article we investigate the family of maps f(x) = mx mod 1, for 2 < m 6 N, 
x G [0,1] and take a to be a permutation transformation on [0,1]. Thus we regard 
a as an element of the symmetric group Sn, which consists of all permutations of iV 
symbols and has order J}5jv = N\. Individually, / and a are well understood in terms 
of their dynamical properties, but their composition a o f is not. It therefore becomes 
an interesting group theoretic and combinatorial problem to determine when a o / 
is mixing. Thus the first problem we consider is how to classify those permutations 
a G Sn for which a o f is mixing (or not mixing). We will show that if N is not a 
multiple of m then a o f is mixing for all permutations a G Sn- This will be the focus 
of Theorem [TJ In contrast, when iV is a multiple of m, we will show that there exist 
permutations in Sn for which a o f is not mixing, but asymptotically (as N — > oo) the 
proportion of such permutations tends to zero. This will be the focus of Theorem [2] 
and Theorem [3j 

In the case where a o f is mixing, we then calculate (or estimate) the rate of mixing 
using spectral methods, see [31 131 HEJ ED]- We will give an overview of the relevant 
theory in Section [31 In particular, we will work with the Perron- Frobenius operator 
Ccrof restricted to the space of functions of bounded variation (BV) and calculate its 
eigenvalues. This operator is not compact on BV, but it does have a spectral gap. We 
will show that the essential spectral radius is precisely 1/m, and use a Fredholm matrix 
approach to calculate the remaining (isolated) eigenvalues. The modulus of the largest 
of these eigenvalues in the interior of the unit circle gives the (exponetial) mixing rate 
of a o f. We analyse how these isolated eigenvalues vary with the permutation a. This 
can be done directly, but the complexity of the calculation increases rapidly as m, N 
grow. Thus we use linear algebra and combinatorial methods to find/estimate the 
largest eigenvalues of C ao f (with modulus less than one). In particular, while / itself 
has mixing rate 1/m, we will show in Theorem [U that if gcd(iV, m) = 1 and N > m, 
then the worst mixing rate for a o f as a varies is | sin(7rm/iV)/(msin(7r/./V))|. This 
value can be made arbitrarily close to one by taking N large. The proof of Theorem H] 
involves an investigation of the eigenvalues of certain doubly stochastic matrices, and, 
in particular, the effect on the eigenvalues of permuting the columns of the matrix. 
Although the determination of the eigenvalues of various classes of stochastic matrices 
is an old problem which has been considered by a number of authors (see for instance 
[21 El [TOl 021 E2]), we give a fairly simple result in this direction (Lemma I8.4[) which 
seems to be new and may perhaps be of independent interest. 

While the above results are for the specific family of maps f(x) = mx mod 1, we 
also consider two further examples, where different behaviours occur. In Section [71 we 
give a piecewise linear Markov map / : [0, 1] — > [0, 1], such that a o f is not mixing 



for a positive asymptotic proportion of permutations a G Sn- The proof is purely 
combinatorial. We also give an example in Section IXB1 to show that, at least for certain 
non-uniformly expanding maps /, the composition with permutations can (typically) 
speed up the rate of mixing. This will happen if / has a well defined source that 
inhibits the mixing, such as a map with intermittent behaviour [91 [15]. Heuristically, 
the mixing rate of a o / will be faster than that of / if the permutation action removes 
the intermittent source. 

2 Mixing versus non-mixing 

In this section we state our results in relation to the question of mixing versus non- 
mixing of a o /. Given a measure preserving system (/, M, /i), we say that the system 
is (strongly) mixing if 

\fi(f- n AnB)- fi(A)fi(B)\ ^ as n ^ oo, (1) 

where A, B are /z-measurable sets. Another version of mixing is that of topological 
mixing, namely we say (/, M) is topologically mixing if for all open U, V C M, there 
exists a constant n (U, V) such that Vn> n , f n (U) fl V ^ 0. To show that / is not 
mixing, then it is usually easier to show that / is not topologically mixing. We will 
take this approach when exhibiting maps / and T such that / is mixing, but T o / is 
not mixing. For the examples that we consider, it will be also true that topological 
mixing implies strong mixing, see [20] and Section [3J 

To be more precise we consider certain piecewise expanding maps / on the unit 
interval. We divide the unit interval into equal subintervals, and compose / with 
permutations of these. To avoid the problem of functions being undefined, or multiply 
defined, at endpoints of these subintervals, we work with (non-compact) intervals which 
are closed on the left and open on the right. Thus we consider piecewise continuous 
maps /: [0, 1) — > [0, 1). We can of course regard / as a map on the compact interval 
[0, 1] (by stipulating /(l) = /(0)) or on the circle S 1 = E/Z. 

We divide the unit interval as follows. Fix N > 2, and let Ij = [j/N, (j + 1)/N), 
< j < N. For any permutation a of {0, 1, . . . , iV — 1} we write a also for the 
corresponding interval exchange map: 

a(x) = x + (cr(j) — j)/N mod 1 for x G Ij. 

We identify the indexing set {0, . . . , N — 1} with the the ring Z/iVZ of integers modulo 
N, so that arithmetic in this indexing set is to be interpreted as arithmetic modulo N, 
and we write Sn for the group of all permutations of Z/iVZ. 

The specific map /: [0, 1) — > [0, 1) we consider is f(x) = mx mod 1 for a fixed 
integer m > 2. Our goal is then to investigate those permutations a G Sn for which 
the composite piecewise linear function g — a o f on [0, 1) fails to be (topologically) 
mixing. 

When iV is not a multiple of m, the result is easy to state. 

Theorem 1 In the above situation, suppose that N is not a multiple of in. Then, for 
every a G Sn, the composite g — a o / is mixing. 



This leaves us with the case iV = ml for some integer £. In this case, we shall give 
a combinatorial characterisation of those a for which a o / is not mixing. In order to 
state the result, we first introduce some terminology. We consider partitions Z/NZ 
into disjoint non-empty sets: Z/iVZ = A\ U . . . U A t . We call the set B = {Ax, . . . , A t } 
of subsets of Z/NZ a block decomposition of Z/iVZ, and refer to the A{ as blocks. We 
say that B is trivial if £ = 1, and that B is ^-stable if, for any j G Z/NZ and 1 < r < i, 
we have j £ A r ^ j + I £ A r . Thus B is ^-stable if and only if each A r is a union of 
cosets of the subgroup IZ/NZ of Z/NZ. If B = {Ai, . . . , A t } is a block decomposition 
and a G SV, then <tB = {a(Ax), . . . ,a(A t )} is also a block decomposition, and we 
define the stabiliser Gb of B as 

G M = {a G S N : <r(!) = B}. 

Then G^ is a subgroup of £V 

Theorem 2 Let /(ar) = mi mod 1 and let N = ml. There is a permutation 5 G Sn 
such that the following holds: for any a G Sn, the composite g — a o / fails to be 
mixing if and only if there is some nontrivial t-stable block decomposition B of Z/NZ 
such that a5 G G®. 

We will use this combinatorial result to investigate the proportion 

_ jjjo- eS me : aofis not mixing } 
Pl ' j " (m£)! [) 

of permutations for which g = o~ o f fails to be mixing. (Here and subsequently, we 
write §X to denote the cardinality of a finite set X.) In Section [5] we will obtain explicit 
formulae for p(£, m) when I is small, while in Section [HI we prove the following upper 
bound: 



2e x m " ] 



Theorem 3 When N = ml with I > 6, we have 

p(l,m) < 11 fy 

In particular, for each fixed m >2 we have p(l, m) — > as I — > oo. 

This may be interpreted as saying that a o f is mixing for almost all permutations 
a when iV is large enough. Since a typically preserves the property of mixing, it is 
then natural to study the mixing rate of a o f relative to that of / alone. 

3 Rates of mixing under permutation action 

For the map f(x) = mx mod 1, m > 2, and for general a G Sn we consider the rate 
of mixing of the composed transformation oof. For specific a G Sn we will calculate 
explicit estimates on the rate of mixing, and show that permutations will either preserve 
the rate of mixing or slow the mixing rate down relative to the transformation Id o / 
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(where Id is the identity). For the latter case, we will show that for given m, there exist 
permutation transformations that give rise to arbitrarily slow (exponential) mixing 
rates. In the other direction, we also discuss briefly the case when / is a non-uniformly 
expanding map with a well defined source that gives rise to the non-uniform expansion. 
We will show in this case, that composing a permutation transformation with / will 
typically speed up the mixing rate. 

3.1 Background on transfer operators and Fredholm matrices 

Our methods for studying mixing rates will utilize the theory of transfer operators and 
Fredholm matrices, see [3l H31 [161 HH1 EO]- We now give an overview of the relevant 
theory. For a measure preserving system (/, M, /x), the rate of mixing can be quantified 
in various ways. However, we will primarily focus on the speed of convergence to 
equilibrium. More precisely, if / : [0, 1] — > [0, 1] is a piecewise expanding map, we 
define the Perron-Frobenius operator Cf : L 1 — > L 1 by: 

f(y)=% 
The operator Cf satisfies the following identity, for G L p , ip G L q (with p~ 1 + q~ 1 = 1): 

(Cf<f>)i/>dx= [<f>(i/>of)dx, (4) 



where dx denotes integration with respect to the reference (Lebesgue) measure. If / 
preserves an ergodic measure \i with density p{x) G L 1 , then (£fp)(x) = p(x). Suppose 
now that we have a Banach space B C L 1 , with p G B, and with norm || • ||g. We define 
the speed of convergence to equilibrium in B as the rate r{n) such that 3 Cg < oo, 

\\£ n f (<f>)-p\\ B <C B r(n), \/0eB,U\\ 1 = l,Vn>l, (5) 

and there exists <p G B with ||0||i = 1, such that (for sufficiently large n) and for some 

cv > o 

I|/:?(0)-pIIb><V(7O, v0gr (6) 

For the whole space L 1 , the rate function r(n) cannot be specified, i.e. there exist 
4> G L 1 for which ||£^(</>) — p\\i decays arbitrarily slowly. When / is a piecewise 
expanding map, the natural space to consider is B = BV, the class of functions with 
bounded variation. We recall this definition as follows, see [13J. Given a function 
: [0, 1] — > R, we define the total variation of as 

n 

var(0) = sup{^ \4>{x k ) - <j>{xk-\)\ ■ < x < . . . < x n = 1}, (7) 

fc=i 

where the sup is taken over all partitions of [0, 1]. We say that has bounded variation 
(i.e. G BV) if var(0) < oo. To make BV into a Banach space, we define the norm 

II ■ II bv by 

lev := ||0||i + var(0), 



and hence consider functions G BV with ||0||bv < °°- Bounds on the rate of 
mixing r(n) can then be obtained by analysing the spectral properties of the restriction 
£/|bv '■ BV — > BV. In particular we say that £/|bv has a spectral gap if 

r := sup {|A| : A G Spec(£ / | BV )\{1}} < 1, (8) 

where Spec(£/) is the spectrum of Cf. Hence for any G BV and any e > 0, the 
spectral decomposition of £/|bv implies that there exists C > such that for all n, 

\\C n f {<j>) - p||bv < C ■ (r + e) n ||0|| B v with H^ = 1. (9) 

Thus t determines the rate of convergence to equilibrium. In general, finding the exact 
value of r is not straightforward. We can, however consider the essential spectral radius, 
r ess (£f\Bv), and this is defined by 

r ess (£f| B v) : = inf{r > : A G Spec(£/|Bv)? |A| > r =>• A isolated}. 

In the above definition of r ess , the isolated eigenvalues A with |A| > r are also of finite 
multiplicity. For piecewise expanding maps, see [121 Theorem 1], we have the lower 
bound on r via 

r > r ess = exp <^ -liminf essinf^o,!] -log \(f') k (x)\ \ ■ (10) 

This lower bound will be strict if there are no isolated eigenvalues with modulus in 
(r ess , 1). To study upper bounds on the speed of convergence to equilibrium we will 
need to determine the location (or existence) of these isolated eigenvalues. We will use 
a Fredholm matrix approach for Markov dynamical systems, see [161 E] • A summary 
is as follows. 

Consider a piecewise linear Markov map /:/—>/, with finite partition V = {ii}f = i, 
and representative transition matrix B. Here B is a q x q matrix with B^ = 1 if 
Ij C f{Ii), and Bij = if f{Ii) H Ij = 0. We will assume that / is differentiable on the 
interior of each element of V. If Cf is the Perron-Frobenius operator, and J C /, we 
consider the power series defined on C x D, with DcC: 



s J 



z,x) =J2^q{Xj)(x) = Xj(x) + J2z n £ n f (Xj)(x), (ii) 



n=0 71=1 



where Xj(x) is the indicator function of J. When J = Jj G V, we will write s J (z, x) as 
s®(z,x). We let s(z,x) be the vector (s^'(z,x))1 =1 , and similarly X_(x) = {X{i)(x))1=\- 
For a Markov system we have the following result. 

Proposition 3.2 For a piecewise linear Markov map f : I — > I with finite partition 
V = {h} q i= i, there exists a q x q matrix $(z), and such that 

s(z, x) = (I- <$>{z))- l X{x). (12) 



The matrix $(z) in Proposition 13.21 is called a Fredholm matrix. 

Proof. We will consider the Markov case where the slope is constant on each 
Ii (but not constant globally). Our proof is a slight adaption of the calculations in 
[16], ITT]. In particular we will obtain an explicit form of $(2). First of all, by definition 
of Cf we have 

„=1 /»(„)=« u ; vy; 



If J = Ii E V , the following hold: 



«=i f n -Hf{v))=x 



^(o(y) 



*w(*) + *£j77rmi E E 



*(j)(£) 



fK/'l/OI^^Ut'ia- 1 )'^! 



f n - 1 (y)=x i,j 



X(i)(x) + 



{mvf^y^- 



Hence we obtain a q x q matrix $(2), with $(z)ij = {^/|(/' | ^)l}-^j> anc ^ 

s(z,x) = (/-$(z)) _1 ^(x). (13) 

This completes the proof. I 

Given the Fredholm matrix &(z), we define the Fredholm determinant to be the 
quantity D(z) = det(I — $(z)). For piecewise-linear expanding (Markov) systems, the 
Fredholm matrix and Fredholm determinant have the following properties, see [TBI E] : 

1. The number of ergodic components of / is equal to the dimension of the 
eigenspace of J — <&(1) associated to the eigenvalue of value zero. The number of 
ergodic components is also equal to the order of the zero at z = 1 in the equation 
det(J - $(z)) = 0. 

2. If zero is a simple eigenvalue of I — <&(1) then the system is ergodic. Moreover if 
{\z\ = 1} fl Spec(£/|Bv) = {1} then the system is mixing 

3. If A G C and |A| > r ess , then A G Spec(£/|Bv) if an d only if z — A -1 is a zero of 
D(z), i.e. D(l/X) = 0. 

4. If D(l/X) = then A is an eigenvalue of £/|bv- 

In addition, the (ergodic) invariant density p(x) can also be computed, and is given by 
the following formula: 

P (x) = J2 sr Tfi ^) (x) ' (14) 



where v = {vi) q i=1 is a left-eigenvector of $(1) associated to eigenvalue 1. i.e. 
Ylii v i^(X)i,j = v j- The proof of equation (|T3|) is quite straightforward, i.e. just check 
that (Cfp)(x) = p(x). Thus, the Fredholm matrix <&(z) at z = 1 can viewed as the 
dual operator to Cf (relative to the vector space generated by the indicator functions 
on the Markov partition V). 

Connection with decay of correlations. 

Instead of considering the speed of convergence to equilibrium a related quantity is 
that of decay of correlations [31 [161 [20]. Given a Banach space B, we say that (/, M, p) 
has decay of correlations in B with rate function r(n) if there exists some Cg such that 

V0,^eS with ||0|| = ||^|| = 1, 



C n ((j),ip,p) :-- 



4>{ip o f n ) dp — (f) dp ip dp 



< C B r{n). (15) 



When ip e L°° and / G BV, then we have 

c n (<P^,p) < u P \u\£ n f(M - plli < \\M\oo\\q(<Pp) - p\\bv (16) 

where in the last step we use || ■ ||i < || • ||bv- Hence the rate of decay of correlations 
depends on the eigenvalues of £/|bv- i- e - we have that Ve > 0: 

lim(|A| + e)-"C(0,^,/i) = O, X' 1 = ini{z : z' 1 e Spec(C f \ BY )\{l}}. (17) 

Connection with dynamical zeta functions 

It can be further shown that the Fredholm determinant D(z) (for a piecewise linear, 
(Markov) expanding system) is related to the dynamical zeta function ((z) via: 

1 , w s v^ z n v-^ 1 



C(z) = T^a where ' C(2;) = exp { H TT H 



D(z) w r p" ^ (f n )'(p) 

I n=l p:/«(p)=p W ; yFI 

see [31 [16] for an overview. Thus, within the region \z\ > r ess , ((z) is meromorphic with 
singularities (poles) at the zeros of D(z). In particular ((z) is analytic in the region 

1^1 ^ ' ess' 

To see the relationship between ((z) and D(z), the Fredholm determinant can be 
calculated as: 

det{(/ - ^(z))- 1 } = exp {-tr (log(J - $(«)))} = exp i ]T -tr($(^) n ) I , 

where tr is the trace operation. If, for example &(z) = —B, where f'\Ii = m for all 
IeV, then 



det(7 -•(,))-= exp {Ef ^} 



and tr(5 ra ) is precisely the number of fixed points of f r 



3.3 Statement of results 

In our setting we consider specifically the map f(x) = mx mod 1. When / is composed 
with a permutation a £ Sn, the resulting piecewise linear transformation may or may 
not be mixing, as discussed in Section [2j Here, we focus on those transformations that 
are mixing, but we consider now the rate of mixing. We state the following result. 

Theorem 4 Fix m, N > 2 and consider the transformations a o f where f(x) = mx 
mod 1 and o £ Sn- Then the following hold. 

1. For all a £ Sn, the essential spectral radius is given by r ess (£ cro /|Bv) = 1/m- 

2. When gcd(m, N) = 1, we have 

sm(7rm/N) 



max{ |A| : A £ Spec(£ CT o/|Bv), |A| =£ 1, a £ Sn} 



msm(ir/N) 
This value therefore gives the worst mixing rate as a varies over Sn- 



(19) 



Remark 3.4 Notice that for fixed m and taking N sufficiently large, Theorem H] 
implies the existence of eigenvalues of £ cto /|bv arbitrarily close to one: for the maximal 
eigenvalue A in equation f[T9~j) . we have |A| ~ 1 — 7r 2 (m 2 — l)/(6iV 2 ) + o(l/iV 2 ). 

Thus, the results obtained here are contrary to the results obtained when 
considering permutation actions applied to diffusive systems, as described in [2]. In 
that setting, the mixing rate is always observed to increase when permutation actions 
are applied at fixed time steps, while in our setting, the permutation actions will either 
preserve the mixing rate or slow it down. 

The proof of Theorem m in particular Statement 2, is given in Section [HJ The 
computation of r ess is straightforward since a o / is piecewise linear with constant 
slope 1/m. Hence Statement 1 is a consequence of [13J. To find upper estimates on the 
mixing rates it suffices to compute or estimate the eigenvalues of the Fredholm matrix 
associated to ao f for a particular permutation a. This will utilize techniques in linear 
algebra and the combinatorics associated to permutation matrices. When f(x) = mx 
mod 1, and a is the identity permutation, an elementary calculation implies that the 
Fredholm matrix $(1) has eigenvalues equal to zero (multiplicity m— 1) and one. Here 
we take the natural Markov partition of a o f by dividing up the unit interval into m 
equal subintervals. 

Let us now consider some explicit examples and make some further observations, 
some of which are based on numerical computations. For each a £ Sn, we are interested 
in the quantity 

A CT = max{ |A| : A £ Spec(£ CTo/ | B v), |A| ^ 1}. 

This gives the mixing rate for cro/, so that in particular, composition with a decelerates 
the mixing if and only if A a > 1/m. For small values of m and N, Table [T] summarizes 
the number of permutations in Sn which this occurs. This suggests that as we increase 
N (with m fixed), we should expect a higher proportion of permutations in Sn to 
decelerate the mixing. We therefore pose the following question: 

9 



SN,mx mod 1 


m = 2 


m = 3 


m = 4 


N = 3 

N = 4 
iV = 6 

iV = 8 








16890 








35152 





144 
18432 



Table 1: The number of permutations in Sn that lead to slower mixing than Id o /. 



Sn, mimodl 


m = 2 


m = 3 


N = 8 


0.416 


0.862 


AT = 30 


0.958 


1.000 


N = 40 


0.978 


1.000 


AT = 50 


0.984 


1.000 



Table 2: The proportion of permutations which decelerate the mixing when taking 
large values of N. For each N, a random sample of 1000 permutations were chosen. 

Question 3.5 Is it the case that 

Ho" eS N :A„> 1/m} _ 1 ? 



JV->oo 



N\ 



To investigate this further, we fixed values m = 2, 3 and chose 1000 permutations at 
random from Sn for A^ large. We then examined the proportions of these permutations 
for which |A CT (m)| > 1/m. Table [2] summarizes the results. 

More generally, it would be interesting to know the distribution of the A^ as a 
varies (with A^ large). As in [5], one might expect this to be well approximated by a 
generalised extreme value distribution with parameters depending on N. 



3.6 Speeding up mixing for non-uniformly expanding systems 

Although we have shown that permutations cannot speed up the mixing rates for 
certain uniformly expanding systems, we will demonstrate here that permutations can 
speed up the rate of mixing if the original (un-permuted) system has a well defined 
source that inhibits mixing. Consider the following family of intermittency maps 
/ Q :[0,l]->[0,1], a G (0, 1) given by 



fa{x) 




ifxG [0,1/2], 
ifxe (1/2,1]. 



(20) 



This family has been widely studied [TSJ EI] and optimal decay of correlations/speed of 
convergence to equilibrium has been established in [S]. In particular, it is shown that 
there exists a Banach space £>, such that 



\C n f (<f>) - p\\ B < Cu-^MIb, 



(21) 
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for all G B with ||0||i = 1. Moreover this asymptotic in n is optimal within B. The 
sub-exponential mixing rate arises since each f a admits a neutral fixed point at x = 0, 
namely /'(0) = 1. The existence of the neutral fixed point inhibits the mixing. In 
particular, the functional analytic methods discussed in Section 13.11 do not apply since 
{A = 1} is no longer an isolated eigenvalue of £/. i.e. there is no spectral gap. 

If we consider f a composed with a permutation a G Sn, then typical permutations 
will destroy the neutral fixed point, i.e. o o f a will be uniformly expanding if a does 
not leave invariant the interval [0, 1/iV]. We state the following. 



Theorem 5 Suppose f a is given by equation l20\) . and a G Sn is a permutation 
transformation on [0, 1] that does not leave [0, 1/N] invariant. Suppose that a o f 
is topologically mixing. Then, in B = BV , we have an exponential rate of convergence 
to equilibrium: 3C > 0, rj G (0, 1), such that, for all probability distributions <fi G B, 

||£? o/ (0)-/>||Bv<C'7f-|Mls, (22) 

Proof. Since a does not preserve [0, 1/iV], the map crof is uniformly expanding on 
[0, 1]. In general the map is not Markov (with respect to a finite partition), but it does 
have bounded variation on [0, 1]. Thus if a o f is topologically mixing, then it follows 
from |13[ EU] that a o / has absolutely continuous invariant measure, with density in 
BV. Since the system is uniformly expanding, the operator C ao f now has a spectral 
gap. Thus the rate of convergence to equilibrium is now exponentially fast. I 



Remark 3.7 Since a o f a is no longer Markov, a complete analysis of which 
permutations preserve topological mixing is a much harder problem, which we do not 
pursue. 

4 When is a o f non-mixing? 

In this section, we prove Theorems [1] and [2j Recall that we are considering the 
composite g = a o f where f(x) = mx mod 1 and a G Sn, and that we partition 
the unit interval into subintervals I a = [a/N, (a + l)/N) for a G {0, 1, . . . , N — 1}, 
where this indexing set is identified with the ring Z/iVZ of integers modulo N. 
To begin with, we allow arbitrary m, N > 2. 

Definition 4.1 For any subset A C 'L/N'L, we define 

m—l 

f(A) = |J (mA + d) C Z/iVZ, g(A) = a(f(A)). 

d=0 

Proposition 4.2 For each A C Z/iVZ, we have 

\a£A / bef(A) \aeA / b&g(A) 
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Proof. This is immediate since for each j G Z/NZ we have 

ra— 1 
d=0 



Proposition 4.3 For each A C Z/NZ, we have 

%A < if (A) < raftA 

Moreover, suppose that < %A < N. Then we have §f(A) = §A if and only if the 
following two conditions hold: 

(i) N = mi for some integer £; 

(ii) A is a union of cosets of (."L/N'L (that is, j^A^j+£eA for all j G Z/NZ). 

Proof. If b G f(A) then b = ma + d (mod N) for at least one of the raftA pairs (a, d) 
with a G A and < d < m. Hence $f{A) < raft A Now fix one pair (ao, ofo)- If another 
pair (a, rf) gives the same element b then 

ma + d = ma + d Q (mod AT). (23) 

Thus d = do (mod s), where s = gcd(m,N). This gives m/s possibilities for d. For 
each of these, (|2"3"|) has s solutions a in L/NL, all congruent mod A^/s (but in general 
not all in A). So each b arises from at most ra of the pairs (a, d), giving ft/ (A) > ft A 
This proves the first assertion. 

If if {A) = ftA / then each b must arise from exactly ra pairs (a,rf). Thus given 
ao G A we ma Y take d = do = 0, and the s solutions a to (|23l) must all lie in A This 
shows that a + N/s G A, so that A is stable under addition of N/s. 

First suppose (i) holds. Then N/s = £, so that if ft/ (A) = ft A then (ii) holds. 
Conversely, if (i) and (ii) hold, then each b G f{A) arises from ra pairs (a +j£, d) with 
< j < ra, so that ft/(A) = ftA 

It remains to show that if (i) does not hold and ft/(A) = %A > then A = 1,/NL. 
So let ra = es with e > 1, and let ao G A Since s < ra, we may take do = s in (1231) . 
But (|2"3"|) must have s solutions for each of the possible values d = d (mod s) with 
< d < ra, so we can find ax E A with raai = raa + s (mod AT). Then ea\ = ea + 1 
(mod AT/s). Iterating, we can find a,- G A with eaj = eaj_i + 1 = ea + j (mod AT/s) 
for j > 1. As gcd(e, N/s) = 1, we have a e = ao + 1 (mod N/s). Since we already know 
that A is stable under addition of N/s, it follows that A is stable under addition of 1, 
so that A = Z/NZ. I 

Corollary 4.4 For any A C Z/NZ, we have 

MA) > ftA 

Moreover, if A is a proper subset of Z/NZ then equality can only occur if N = £m for 
some integer £. 
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Proof. This is clear since §g{A) = §f(A). I 

Lemma 4.5 g fails to be (topologically) mixing if and only if there is some proper 
subset A ofZ/NZ such that $g r (A) = %A for all r > 0. 

Proof. Let A be a subset with < %A < N and §g r (A) = %A for all r. As there are only 
finitely many subsets of Z/iVZ, we may choose s > and t > 1 with g s+t (A) = g s (A). 
Set B = g s (A) and take nonempty open sets U C Ij and V C Ik where j G B and 
k <£ B. Then for all n > we have a nt (C7) C U fe6B I b so that ^"'(f/) n V = 0. Hence 
is not mixing. 

Conversely, suppose there is no proper subset A with §g r (A) = §A for all r. To 
see that g is mixing, we show that for any nonempty open subset U of [0, 1) we have 
g n (U) = [0, 1) for large enough n. Without loss of generality, U is an interval of length 
5 > 0. Since m > 1, we can choose h large enough that g h {U) contains the initial point 
j/N of some interval Ij. Then for some e > 0, we have [j/N,j/N + e) C g h (U) fl Ij. 
Choose k so that m k e > 1/N and let g k (j/N) = f/N. Then I jt C g k (Ij) C g h+k (U). 
Now let _B = {j'} and take s > 0, t > 1 with g s+t (B) = g s (B). The nonempty set 
A = g s (B) then satisfies the condition g nt (A) = A for all t > 0. Hence, by Corollary 
I4.4} we have tl<? r (^4) — tt-4 for all r > 0. Thus our hypothesis forces A = Z/iVZ, so that 
gi(I f ) = [0, 1) for all q > s. It follows that g n {U) = [0, 1) for all n > h + k + s, as 
required. I 

Proof of Theorem d Suppose that iV is not a multiple of m, and let g — a o f with 
a G Sjv- By Corollary 14.41 there is no proper subset A with §g(A) = (jA. Hence by 
Lemma F4.5[ (? is mixing. I 

We now suppose that the N = ml for some integer £ > 1. 
Proposition 4.6 There exists a permutation 5 G S^ swc/i #m£ 

f(Ij)Dl siJ) foraUjeZ/NZ. (24) 

For any suc/i 5 7 and any A C Z/iVZ, tae following are equivalent: 
(i) $(A) = %A; 

(ii) A is a union of cosets of the subgroup £Z/NZ of Z/iVZ; 
(Hi) o-5(A) = ~g(A). 

Proof. To prove the first assertion, we exhibit a permutation S with the required 
property. For < i < N, write i = j + c£ with < c < m and < j < £, and set 
8(i) = mj + c. It is routine to verify that 5 G Sjy, and, as 

TO— 1 



/CO) - U Irnj+d, 



d=0 
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the condition (j2l]) holds. 

Now fix a choice of 5 G SV satisfying (12"4"]1 . Since |jg(A) = (1/(^4), the equivalence of 
(i) and (ii) follows from Proposition 14.31 Since o~5 G Sn, it is immediate that (iii)=^(i). 
It remains to show that (ii)=>(iii). 

Since f(Ij) = f(Ij+£) for each j, it follows from ([24]) that 5 takes the m elements 
j ' + ci, < c < m to the m elements mj + d, < d < m in some order. Thus, if (ii) 
holds, 5 takes each coset a + £Z/iVZ contained in A to /({a}). Thus 5(A) = f(A), and 
applying a gives (iii). I 

Proof of Theorem Let a5 G Gb for some nontrivial, ^-stable block decomposition 
B, and let A be a block of B. Then A is a proper subset of Z/iVZ which is a union of 
cosets of £Z/iVZ. Thus #(A) = <7<5(y4) by Proposition 14. 6[ and this set is also a block of 
B. Inductively, we then have g r (A) = (a5) r (A), and hence $g r (A) = $(a5) r (A) = $A, 
for all r > 0. It then follows from Lemma 14.51 that g is non-mixing. 

Conversely, suppose that g is non-mixing. By Lemma I4.5[ there is a proper subset 
A of Z/iVZ such that §g r (A) = %A for all r > 0. By Proposition 14.61 and induction, 
g r (A) = (a5) r (A) for all r > 0. Moreover, each (a5) r (A) is a union of cosets of £Z/iVZ. 
Since cr5 is a permutation, it follows that (o~5) s (A c ) is also a union of cosets for each 
s > 0, where A c is the complement of A. Let B be set of all intersections of the sets 
(o-5) r (A), (o-5) s (A c ) for r, s > 0. Thus B is a collection of subsets of Z/iVZ, each of 
which is a union of cosets of £Z/iVZ. Let B be the collection of minimal nonempty 
sets in B. Then B is an ^-stable block decomposition and aS G G-&. Moreover, B is 
nontrivial since A is a union of blocks of B. I 



Remark 4.7 A similar argument shows that foa is non- mixing if and only if 5a G Gb 
for some nontrivial ^-stable block decomposition. 

5 The Proportion of Non-mixing Permutations 

We continue to assume iV = ml. We shall investigate the proportion p(£, m) of 
permutations o G S^ for which g = a o f is not mixing. By Theorem 121 this is the 
proportion of permutations such that 5a is in the stabiliser of at least one nontrivial 
^-stable block decomposition. For small values of £, we can use the Inclusion- Exclusion 
Principle (see e.g. [HI p. 21]) to count the number of such permutations directly. We 
will also give the precise asymptotic behaviour of p(£, m) as m — > oo with £ fixed. In 
the next section, we give an asymptotic upper bound for p(£, m) as £ — > oo with m 
fixed, thereby proving Theorem [3] 

The stabiliser of any nontrivial ^-stable block decomposition contains the subgroup 
H = S m x ... x S m of order (ml) which permutes the m elements of each coset amongst 
themselves. It will be convenient to calculate the index of H in the various stabilisers 
we consider, and to express these in terms of multinomial coefficients. For integers 
7*1, . . . , Tj > 0, the corresponding multinomial coefficient is 

' s \ sl 

r x ,...,rjj n\...rA 
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where s — r\ + ■ ■ ■ + Tj. This counts the number of ways of putting s labeled objects 
into j labeled boxes, so that, for 1 < % < j, box i contains rj objects. When j = 2, the 
multinomial coefficient reduces to the more familiar binomial coefficient. 

In order to refer to specific block decompositions, we let Ci,...,Ce denote the 
cosets of £7j/N7j in Z/iVZ (in some order). Giving an ^-stable block decomposition 
amounts to giving a partition of {C\, . . . , Ce}, and we denote the block decomposition 
by the corresponding partition of the set of indices {1, . . . , £}. Thus {1, . . . ,£— 1}, {£} 
represents the ^-stable block decomposition consisting of the two blocks C\ U . . . U Ce-\ 
of size [I — l)m and Ce of size m. 

5.1 £ = l 

The only ^-stable block decomposition is the trivial one, so g = a o / is mixing for all 
a, and p(l,m) = 1. 

5.2 £ = 2 

There is only one nontrivial ^-stable block decomposition. This has two blocks, each of 
size m. Its stabiliser contains H and also contains elements swapping the two blocks, 
so has order 1%H . Thus 

%H (2m -I 

In particular, taking m = 2, we get p(2, 2) = 1/3. Thus, when the doubling map 
f(x) = 2x mod 1 is composed with permutations a of the 4 equal subintervals of [0, 1), 
those o G S*4 for which / o a is not mixing form a single coset of a subgroup of index 3 
in S4. (Any such subgroup is dihedral of order 8.) 

5.3 £ = 3 

There are 4 nontrivial ^-stable block decompositions: 
« {1,2}, {3}; 
(ii) {1,3}, {2}; 
(iii) {2,3}, {1}; 

H {1}, {2}, {3}; 

We shall apply the Inclusion-Exclusion Principle directly. 

The stabiliser of any one of the block decompositions (i), (ii), (iii) has order 
{2m)\m\ = (^)tjif since it contains any permutation of the 2m elements in the block 
consisting of 2 cosets. The stabiliser of the block decomposition (iv) has order 6JjiJ 
since we may permute the 3 blocks amongst themselves in 3! = 6 ways. 

We now consider the stabilisers of any of the ( 2 ) = 6 pairs of the block 
decompositions. First consider the 3 pairs consisting of any two of (i), (ii) or (iii). 

15 



Any permutation fixing such a pair must fix each coset, so the stabiliser of any of these 
3 pairs is just H . A permutation stabilising (say) (i) and (iv) could also swap the 
cosets C\ and C2, so the stabilisers of the other 3 pairs have orders 2§H. The stabiliser 
of any 3 (or all 4) block decompositions is again just H . Thus the precise number of 
permutations in S^m fixing at least one of the block decompositions is 

3 ( J + 6 - 3 ~ 3x 2 + (0) _1 ) $ H = 3 ( ')tH = Z(2m)\m\. 



Hence 



1 /3m — 1 

P(3 ' m)= (3^)! X3(2m)!m!= l 2m 



In particular, p(3,2) = 1/5. 

5.4 i = 4 

For the sake of brevity, we only sketch the argument in this case. There are 14 nontrivial 
^-stable block decompositions, but we can eliminate half of them before applying the 
Inclusion-Exclusion Principle. The stabiliser of the unique block decomposition with 
block sizes 1, 1, 1, 1 contains H as a normal subgroup, the quotient being the group S4 
of all permutations of the 4 cosets. Let 7r G S4 be the image of some a in the stabiliser. 
Then either 71 or 7r 2 has more than one orbit on {1, 2, 3, 4}, and we can use this to find 
another block decomposition stabilised by a. Also, any permutation in the stabiliser of 
a block decomposition with block sizes 2, 1, 1, say {1,2}, {3}, {4}, also stabilises the 
block decomposition {1, 2}, {3, 4}. This means we only need to consider the stabilisers 
of the 4 block decompositions with block sizes 3, 1 and the 3 block decompositions 
with block sizes 2, 2. 

We can then apply to the Inclusion- Exclusion Principle to the stabilisers of these 7 
block decompositions, by considering all possible pairs, and, for each pair, considering 
any ways of extending the pair to a larger subset of the blocks with stabiliser larger 
than H . In doing so, we take care not to include any higher order stabilisers which 
have already been counted. (This means that the order in which we consider the pairs 
is important.) We omit the details of this calculation. After some simplification, we 
obtain the formula 



p(4,m) 
In particular, we find 



3m \ /2m\ 

'61 ) -12 

m 



/2m\ 
\m J 



M) 4 



(4m)!' 



1 37 

p(4,2) = -, p(4,3) 



5' " v ' ' 1540 

Note that, in contrast to the cases I = 2 and <L = 3, p(4, m) is not in general the 
reciprocal of an integer. 



16 



5.5 Arbitrary d 

For larger values of £, obtaining an explicit formula for p(£, m) by the Inclusion- 
Exclusion Principle becomes impractical, but we can at least determine the asymptotic 
behaviour of the dominant term for large m when £ is fixed. As this does not contribute 
to our goal of investigating the effect of composing multiplication by a fixed m with 
permutations, we will content ourselves with a brief sketch of the argument. 

Fix £ > 2. Arguing as above, we find from the Inclusion- Exclusion Principle that 
p(£, m) is a sum of terms of the form 

, (rim) ! . . . (fj-m) ! rl ( £m 



[£m)\ vim, . . . r^m) 

where ri + • • • + r,- = £ with j > 2 and 1 < ri < . . . < rj, and where C is a (positive or 
negative) integer depending on £ and the rj. One such term (corresponding to ^-stable 
block decompositions with 2 blocks of sizes (£ — l)m and m) has r x = 1, r 2 = £ — 1 and 
C = £. For £ > 3, there are £ such block decompositions each with stabiliser of order 
((£ — l)m)!m!, while for £ = 2 there is only one block decomposition but its stabiliser 
has order 2(m!) 2 . Thus we have C = £ in both cases. 
Applying Stirling's formula 



\j2stvn ( — ) asn-> oo, 



n\ ~ V /7m 



see for example pjJ, P- 47]) to each factorial in (j2"3"l) separately, we find that the term 
251) is asymptotic to 

Drnp-Wp™ 



as m -^ oo, where 



and 



D = C(2tt 



,0-l)/2, / r l--- r 7 



P 



n 

«=i 



'/'; 



Note that < p < 1. Moreover, one can verify that p is maximal for the term with 
j — 2, n = 1, r-i — £ — 1. (First show that splitting some r^ as r^ = r' + r" with r', 
r" > 1 decreases p, so that the maximum must occur when j = 2. Then show that, 
amongst terms with j = 2, p is maximum at r\ = 1 by differentiating the logarithm of 
x x (£ — x) £_x .) This term therefore dominates the asymptotic behaviour of p(£, m) as 
m —7- oo. 

This gives us the following result: 

Lemma 5.6 For fixed £ > 2, we have 



p(£, m) ~ y/27i(£-l)£m (*-^ 



m 

as m — > oo. 
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6 Asymptotic behaviour as £ — > oo 

Our goal in this section is to use combinatorial arguments to complete the proof of 
Theorem [3j 

From Theorem [2] we have 



^■'">^£«G 



(26) 



where the sum is over all nontrivial ^-stable block decompositions B. Given integers 
1 < T\ < . . . < Tj with ri + • • • + Vj = £, we consider the contribution to (12 6 p from 
all block decompositions B with block sizes mri, . . . ,mrj. The number of such block 
decompositions can be given in terms of a multinomial coefficient. Let us set 



77. 



(r 1 ,...,r j ) = ${h :r h = i} 



and 



d(n,...,rj) = Y[m(r !,..., rj)\. 



i=l 



Then the number of ^-stable block decompositions B of {1, . . . ,m£} with block sizes 

mr\ , . . . , mvj is 

1 



d(r 1 ,...,r j ) \r 1 ,...,r j/ 

Moreover, any such B is preserved by a group of permutations S mn x • • • x S mrj 
permuting the elements within each block, but we can also permute the blocks of any 
given size amongst themselves. Thus we have 



(JG B = d{r 1 , ...,rj) J~[{mr h )\ . 



\h=l 



The contribution to (|2T)j) from block decompositions with block sizes mr% 
therefore 



, . . . , mrj is 



(ml)\ 
which simplifies to 



d(r x ,...,rj) Y\{mr h )\ 



v/l=l 



r%i ■ ■ ■ ,Tj/ \mri 



d(r 1 ,...,r j ) \r 1 ,...,r j 



m£ 



L ,..., mrj 



-i 



Thus we may rewrite (1261) as 



p(4m)<£>;(4 

J=2 



where 



l<r!<...<rj v J/ v 



777^ 

Tnrx , . . . , mr^ 



ri+...+r,- 



The definition of 6j(^) makes sense for j — 1, giving bi(£) = 1. 



(27) 



Proposition 6.1 For 2 < j < £, we have 

r=l 

Proof. Separating out r\ in the definition of bj(£), we may write 

wn / n X / B X -1 



^fQ{^\-^-r). 



rT^i i<, .^< rj . vi > • • • . r jJ \ mr i ,---,mrJ 

T2-\ \-rj=£-r-i 

(Note that we have "<" rather than "=" since the condition r 2 > r\ has been weakened 
to r 2 > 1.) The result then follows on using the (easily verified) identity 

£ \ = fi\f t-n 

together with the corresponding identity where all the arguments are multiplied by m. 



Lemma 6.2 Suppose that m > 2 and £ > 3. Then, for 1 < j < £, we have 

h < (j) ■ (28) 

Proof. We argue by induction on j. The result holds for j = 1 since b\{£) = 1. 
Suppose that 2 < j < £ and the result holds for j — 1. From Proposition 16. 1[ we have 

M^OQ'V^-D + IOO-V^-O. (29) 

For the first term, we have the estimate 

£\fmt\-\ ( „ , £(m\) ( 2e V™" 1 ^) 



lj\mj J v ' ~ (m£)(m£ - 1) . . . (m£ - £ + 1) \£ - 1 

(m-1)! (2e £ \ ^-M" 2 -' 



< 



1/1 £ 2\ m_1 (2e o \( m -^- 2 ) 



2\2 £-1 £J V £ £-1 
< 



I /2e x (m_1)(i_2) / » x (^-iJO'-i) 



But 



(m-l)O'-l) / , v (m-l)(<-l) 

< I -— < e m ~ l 



1 / ~ \£- 1 
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since (1 + -) n is an increasing function of n and (1 + -) n — > e as n — V oo. As m > 2, 
it follows that 

We now consider each term in the sum in (1291) . For 2 < r < \i/j\ , we have 

\ m < (ml 
r ) ~ \mr 

(This is obvious combinatorially: some of the ways of choosing mr objects from ml 
are given by choosing r objects from the first £, then another r from the second I, and 
so on.) Also, since 2 < r < 1/2, we have 



< 

2j ~ \r 

Thus 

'A (ml\- ^ (l\ l - m , fl\ l - m 



K r)\mr) ~ \rj ~ \2J l m - x {l - 1)^-1 ' 

From the induction hypothesis, we have 

2e v (m-l)(j-2) / 2e \ ("»-l)0'-2) / • \ (m-l)(j-2) 



and (j7(j - l))^ 2 < (j/(j - I))'- 1 < e. Thus 

v^/A/mfV 1 ^ 2 m ~ 1 /2e\ (m ~ 1)(i ~ 2) 

2e \ (m-l)O-l) 



e m-l 



j(£_ 1 )m-l ^£ 

But as j > 2, m > 2 and £ > 3, we have 



j^-l)™- 1 ~ 2(^-1) ~ 4' 
Substituting the last estimate and (1311 into (T2"9~j) . we therefore obtain 



b s < 



2gX (m-l)O'-l) 



which completes the induction. 
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Proof of Theorem^ Since we assuming £ > 6, we have 2e/£ < 1. It then follows from 
(1271) and Lemma El that 



p(£,m) < J2[j) 



x /2e\ (m ~ 1)(j ~ 1) 

3=2 



2e 



m—l 



2e 



m—l 



-1 



As m > 2 and 2e/£ < ^|, this gives 



p(£,m) < 



, - . * 



-1 /r, \ m—l 

2e 



-^ <11 ^ 



2e 



(m-l) 



as required. 



7 Mixing for subshifts 

In this section, we give an example to show that the analogue of Theorem [3] no longer 
holds if we replace the multiplication-by-m map / with another Markov map on the 
unit interval. We consider the piecewise continuous function / : [0, 1) — > [0, 1) given 
by 

' 2x 



m 



if < x < \, 
x-\ if | < x < 1. 



(31) 



Fix £ > 1 and divide [0, 1) into N = 2£ equal sub intervals 

J J + l' 



4 



2V 2£ 



0<j<2£-l. 



For a permutation a G S^ of these subintervals, let g = a o f. We have the following 
result. 

Theorem 6 The proportion of permutations o for "which g is non-mixing (and indeed, 
non-ergodic) is bounded away from as £ — >■ oo. 

Proof. For any subset AC{0,...,2M}, define g(A) C {0, . . . , 2£ - 1} by 
£f(A) = {cr(2j), < r(2i + l) : j E A, j < £} U {a(j - £) :jeA,j>£}. 
Then, analogously to Proposition 14.2} we have 

»(lJ'.)= U '>■ 

\a£A / beg(A) 
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Note however that Proposition 14.31 no longer holds: for example, if A = {0,£, £ + 1} 
(with £ > 2) then g(A) = {0, 1} has fewer elements than A. 

Now if there is some nonempty subset A such that g r (A) ^ {0, . . . , 2£ — 1} for 
all r > (respectively, such that [j r>0 g r (A) ^ {0, • • • , 2£ — 1}) then g is non-mixing 
(respectively, non-ergodic). But if a has the property that a(j — £) = j for some j > £ 
then, taking A = {j}, we have g r (A) = A for all r. Thus g is both non-mixing and 
non-ergodic. We therefore need to investigate the proportion of permutations with the 
above property. 

Let 1 < m < £ and let S be a subset oi {£,... ,2£ — l} oi size m. There are (2£ — m)\ 
permutations a G Sit such that a (J — £) = j for all j G S. Moreover, the number of 
such sets S of size m is (J . Thus, by the Inclusion- Exclusion Principle, the proportion 
of permutations a G S 2 £ with a(j — £) = j for at least one j > £ is 



m=l 

where 

(£\(2£-m)\ 

ttm ~ \m) (2£)\ ■ 
Now the terms in the alternating series are decreasing: for 1 < m < £ we have 

a m+1 m\(£ - m)\(2£ - m - 1)\ £-m 1 



a m (m + 1)\(£ - m - l)\(2£ - m)\ (m + l)(2£-m) 2(m + 1) ' 

Hence the required proportion is bounded below by 

1 £{£-!) 3 
ai_a2 ~2 _ 2(2£)(2£-l) > 8' 

So we have proved that, for each £ > 1, the function a o / is non- mixing (indeed, 
non-ergodic) for more than 3/8 of the permutations a G S^. I 

Although the map / in (13T|) is not uniformly expanding, a finite power of / is. Thus 
we can still apply the results of Section [3] to study the mixing rates. The associated 
Fredholm and transition matrices are given (respectively.) by: 

and thus the invariant density is given by p{x) = \X^{x) + ^X^){x), (see (fT4"|) ). Using 
this information we get the following results. 

1- r ess = e" 5 where £ = f Q (log f'(x))p(x)dx = flog 2. 

2. The topological entropy is given by log( 1+ 2 ^ 5 ) < r ess . 

3. det(J — $(2)) = implies that z = 1, —2. Thus 1 is the only eigenvalue of £/ 
outside |z| > r ess . 
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It remains an open problem to determine the corresponding quantities for a o f with 
a £ Sn, N > 1. To investigate the rate of mixing of a of (relative to / alone), we would 
also need to calculate the invariant density associated to each a £ Sn (since Lebesgue 
measure is not invariant). This would allow us to compute the corresponding value 
of r ess := r ess (a). The isolated eigenvalues of £0-03 would be found via the associated 
Fredholm matrices. 



8 Proof of Theorem 3 



In this Section we consider f(x) = mx mod 1 and o G Sn- We will exhibit 
permutation transformations for which a o f mixes arbitrarily slowly in the sense of 
( TB?|) . The proof requires a detailed study of the eigenvalues of the Fredholm matrices 
$(z) associated to a o /. These matrices are attached to a partition of [0, 1] on which 
er o / is Markov, so we first need to determine such a partition. 
For k > 1, consider the partition 

V k :={[(j-l)/k,j/k) : l<j<k} 

of [0, 1) into k equal subintervals. Then the map / is Markov w.r.t. V m , while the map 
a is Markov w.r.t. Vn- The map a o f, however, is in general not Markov w.r.t. either 
of these partitions. For example consider m = 2, N = 3. Clearly any cr £ S3 is Markov 
on the partition 

V 3 = {[0,1/3), [1/3, 2/3), [2/3,1)}. 

However, if we take the permutation a interchanging the last two subintervals, then 

we have 

(2x if <x< 1/6, 

o o fix) = < 

[2a; + 1/3 if 1/6 < x < 1/3, 

so that a o f is not continuous on [0, 1/3] and hence not Markov on P. In general, to 
ensure that a o / is Markov for all a £ Sn, we must work with the partition V^m- 

Due to our specific choice f(x) = mx mod 1, the Nm x Nm matrix $(1) is 
precisely the probability-transition matrix between the Markov states, and has all 
its entries in {0, 1/ra}. If A £ Spec(£/|Bv) then we know that z = 1/A is a solution 
to D(z) = det(J — *&(z)) = 0. It is therefore an equivalent problem to consider the 
corresponding equation (in A) to det(B — XI)— 0, where B is the state transition matrix 
(with entries in {0, 1}). Hence if A is an eigenvalue of B, then A = X/m £ Spec(£/JBv)- 

Note that in our case, m$(l) is precisely the state transition matrix B. We will 
show that the eigenvalues of $(1) can in fact be determined from the N x N transition 
matrix associated with the partition Vn (on which a o f need not be Markov). 

We must first define some notation. Following the conventions of Section [2J we 
index the subintervals in Vk by {0, 1, . . . , k — 1}. We therefore begin the numbering 
of the rows and columns in the associated matrices from 0. We define A(m, N) and 
B(m,N) to be the state transition matrices for / w.r.t. Vn and Vntu respectively. 
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Thus for < i, j < N — 1 we have 

A(m, N)tj 

and for < i, j < Nm — 1 we have 

B(m,N)ij -- 



1 if j '• = mi + d mod N with < d < m — 1, 
otherwise, 



1 if j ; = mi + d mod Nm with < d < m — 1, 
otherwise. 



For example, when m = 1 and TV = 3, we have 



4(2,3) 




5(2,3) 



(I 1 





\ 





1 1 











1 1 


1 1 











1 1 





\ 





1 1/ 



The eigenvalues for .4(2,3) are {±1,2}, while those for 5(2,3) are {±1,2,0}, where 
the eigenspace for the eigenvalue has dimension 3. In the case m = 3, N = 5 we have: 



4(3,5) 



/l 1 1 0\ 
10 11 
1110 
110 1 

\0 1 1 1/ 



and the eigenvalues for 4(3,5) are {3,±i,±l}. Note that all row sums and columns 
sums in both A(m, N) and B(m, N) are m. Each row in either matrix consists of m 
consecutive occurrences of 1 (where, in the case of A(m, N) these may "wrap around" 
from the last column to the first). The rows of B(m, N) naturally fall into m identical 
blocks each consisting of N rows, and the columns into iV blocks each consisting of m 
identical columns, as indicated for 5(2,3) above. 

The corresponding state transition matrices for a o f are obtained by permuting 
the columns of A(m,N) and B(m,N). More precisely, given a permutation a of 
{0, . . . , N — 1}, let P((t) be the N x N permutation matrix given by 



P(a 



if J 



o U 



K) 



otherwise, 



and let Q(cr) be the Nm x Nm matrix obtained by replacing each entry 1 (respectively, 
0) in P(cr) by an m x m identity matrix (respectively, zero matrix). Then the state 
transition matrices for a o / w.r.t. the partitions Vn and V^ m are A(m, N)P(a) and 
B(m, N)Q(a) respectively. For example, if m = 2, iV = 3 and a is the 3-cycle (0, 1, 2) 
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then 



P( 



<J 




Q{o) 



/ 




1 
1 


\ 












1 
1 


1 
\0 1 







0/ 



so that 



A(2,3)P(<r) 



'0 1 r 

1 1 0) , B(2,3)Q(a) 
1 1 



/ 




1 1 




\ 

1 1 


1 1 






1 1 








\1 1 






1 1 

0/ 



Note also that P(a)A(m,N) is the matrix obtained by applying the inverse 
permutation a -1 to the rows of A(m, N). 

To determine the mixing rate of a o /, we need to investigate the eigenvalues of 
the Fredholm matrix $(1) = m~ 1 B{m,N)Q(a) corresponding to the partition Vntu 
on which a o f is Markov. Clearly A is an eigenvalue of $(1) if and only if mX is an 
eigenvalue of B(m, N)Q(o~), so it suffices to find the eigenvalues of the latter Nm x Nm 
matrix. In fact we only need consider N x N matrices. 

Lemma 8.1 For all m, N and all o G SV, the nonzero eigenvalues of B(m, N)Q(a) 

are the same as those of A(m,N)P(a). 

Proof. For brevity, we write A = A(m, N), B = B(m, N), P = P(o~) and Q = Q(cr). 

We view BQ as determining a linear endomorphism 9 on the space V = <C Nm of 
column vectors. Clearly BQ has rank N, since the first N rows are linearly independent 
and the remaining rows merely repeat these. The kernel W of therefore has dimension 
N(m — 1), and induces an endomorphism on the quotient space V/W of dimension 
N. The eigenvalues of (that is, of BQ) are therefore the eigenvalues of 0, together 
with the eigenvalue of multiplicity N(m—1) coming from W. The result will therefore 
follow if we show that the matrix AP represents 0. 

We define vectors v r ' s for 0<r<N — 1, 0<s<m— 1 (independent of a) as 
follows. For s = 0, set 

1 if % = mr, 
otherwise, 



r,0 

v„- 



and for s > 0, 



—1 if % = mr, 

I if i = mr + s, 
otherwise. 
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For example, if m — 2 and iV = 3 we have 



(I) 







V J 


, v ^ 






V o J 


, v 1 -° = 


(I) 

1 





V J 


, v 1 -^ 


f I) 

-1 
1 


I J 


, v 2 -° = 


(I) 




I I J 


, v*- 1 = 


( °\ 






1 
V -i / 



where the horizontal lines correspond to the division of the columns of B(2,3)Q(o~) 
into blocks. 

It is clear that the v r ' s form a basis for V, and that if s ^ then BQv r,s = 0. 
Hence the N(m — 1) vectors v r ' s for s ^ form a basis for W. Thus the iV cosets 
v r '° + W form a basis for V/W. If we partition BQ into mxm blocks (as in the above 
example), the matrix of 9 with respect to this basis is then obtained by replacing each 
block with the sum of one of its (identical) columns. This gives precisely the matrix 
AP. I 



We next consider a matrix related to A(m, N) but with eigenvalues that are easy 
to determine. Let C(m, N) be the N x N circulant matrix with entries 



C(m, N)ij = 
for < i, j < N — 1. For example 



1 if j = % + r mod iV with < r < m — 1, 
otherwise 



C(2,5) 



/l 1 0\ 

110 

110 

11 

\1 1/ 



Proposition 8.2 The matrix C(m, N) has eigenvalues 

m—l 



A 



E 



W.; 



t=o 



< j < N - 1, 



where Uj = e 2m ^ N . In particular, \ = m and, for 1 < j '< N — 1, 



|A,-| 



sin(mJ7r/A^) 



Moreover, 



det(C(m,N)) 



sm(J7r/N) 



if gcd(m,N) > 1; 

±m i/ gcd(m, JV) = 1. 



Proof. We set C = C(m, N). It is easily checked that 
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(32) 



(33) 



is an eigenvector of C with eigenvalue Xj. Although the Xj are not necessarily distinct, 
the N vectors Vj are linearly independent since 

det(ujj)o<j,k<N = Y\_( u k ~ u j) ^ °> 
so there are no further eigenvalues. Trivially Ao = m. For j ^ 0, we have 

e 2Trimj/N _ j 



A 



'J " e 2nij/N _ I 



Taking the modulus gives the required expression in equation (|33 
Moreover, we have 

it ,«, ttm i tt sm(mJ7r/N) 

|det(C)| = UlA.-HmJf 



3=0 3=1 



sm(J7r/N) 



If gcd(m, N) — s > 1 then, taking j = N/s, we have sm(mJ7r/N) = and hence 
det(C) = 0. 

If gcd(m, N) = 1 then the residues mj mod iV are just the residues j mod iV in 
some order, so the product reduces to 1. I 



The maximum \Xj\ for 1 < j < N — 1 in Proposition 18.21 is attained at j = 1. 
Although this is essentially elementary, it is trickier to verify than it might appear, so 
we include a proof. 



Proposition 8.3 



max 

1<3<N-1 



sin(mJ7r / 'N) 



sm(J7r/N) 



sin(m7r/iV) 



sin(7r/A^) 



Proof. Since | sin(7rA; ± x)\ = \ sinx| for all k G Z, we may assume that 1 < m < N/2, 
and moreover it suffices to take 1 < j < N/2. We consider the two functions 
u(x) = sinmx/sinx and v(x) = 1/sinx on the interval (0,7r). Now u(x) has pre- 
cisely m — 1 zeros on this interval, at x — hir/m for 1 < h < m — 1. Since u(x) 
may be written on as a polynomial of degree m — 1 in cosx, and cosx is monoton- 
ically decreasing on this interval, it follows that u(x) has precisely m — 2 stationary 
points, one in each of the intervals (hn/m, (h + l)7r/m) for 1 < h < m — 2. In partic- 
ular, as lim x ^ u(x) = m, it follows that u(x) is positive and decreasing on (0,7r/m), 
so that u{ix/N) > u(jir/N) > if 2 < j < N/m. On the other hand, as v(x) 
is positive and decreasing throughout (0,7r/2), we have for N/m < j < N/2 that 
\u(jn/N)\ < v(jir/N) < v(ir/m) < v(n/2m). But v(ir/2m) = u(n/2m) < u(ir/N) as 
m < N/2. Hence \u(jir/N)\ < u(n/N) for 2<j< N/2, as required. I 

We now seek to relate the eigenvalues of the matrices A(m, N)P(cr) to those of 
C(m,N). After a suitable scaling, these matrices are doubly stochastic. Our next 
result gives some information on the behaviour of the eigenvalues of a columm stochastic 
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matrix under permutation of its columns (or, more generally, under right multiplication 
by an orthogonal, column stochastic matrix). 

Recall that an N x N matrix is row (respectively, column) stochastic if its entries 
are nonnegative real numbers and the sum of each row (respectively, column) is 
1. It is doubly stochastic if it is both row and column stochastic. The product of 
two row (respectively, column, doubly) stochastic matrices is again row (respectively, 
column, doubly) stochastic. For A(m, N) as above, the probability transition matrices 
irT 1 A(m, N) are doubly stochastic. Any permutation matrix P(cr) is doubly stochastic 
and orthogonal. 

We view our matrices as linear maps on the space C^ of column vectors, endowed 
with the usual complex inner product (x, y) = X]j=i x i^j f° r x = ( x i; • • • i x n) T , 
y = (yi, . . . ,Vn) t , and write ||x|| = a/(x, x) for x G C N . Any row stochastic matrix 
has the obvious eigenvector e = (1, . . . , 1) T with eigenvalue 1. It is well-known that 
any eigenvalue A satisfies |A| < 1. If B is a column stochastic matrix then e is not 
necessarily an eigenvector for B, but if (x, e) = then (-Bx, e) = 0, so that B preserves 
the subspace V of vectors in C^ perpendicular to e. 

Lemma 8.4 Let B be an N x N column stochastic matrix. Then the eigenvalues of 
B T B on V are real and nonnegative. Let r\ be the largest of these, and let P be an 
N x N orthogonal, column stochastic matrix (e.g. a permutation matrix). Then every 
eigenvalue X of BP on V satisfies 

|A| < sfn- 

Moreover, if B is a circulant matrix then 

^fr\ = max{ |A| : X is an eigenvalue of B on Vo}. 

Proof. Since B T B is a real symmetric matrix, its eigenvalues are real. Moreover, for 
any x G C , we have (B T Bx, x) = (-Bx, Bx) > 0, so these eigenvalues are nonnegative. 
We have 

7] = max{ (B T Bx, x) : x G Vq, ||x|| = 1} = max{ (Bx, Bx) : x G Vo, ||x|| = 1}. 

, (34) 
Now let y G Vo be an eigenvector of BP, corresponding to the eigenvalue A, and 
normalised so that ||y|| = 1. Then 

|A| 2 = (Ay, Ay) = (BPy,BPy) = (Bz,Bz), 

where z = Py. But z G Vo since P is column stochastic, and ||z|| = 1 since P is 
orthogonal, so that |A| 2 < rj as claimed. 

Now suppose that B is also a circulant matrix. Let y^ = N~ 1 ^ 2 Vj, where the Vj are 
as in the proof of Proposition 18.21 Then the y^ for 1 < j < N — 1 form an orthonormal 
basis of eigenvectors for B on Vo. Let Xj be the eigenvalue for y^, and let k be an 
index such that \Xk\ = max^xTv-i |Aj|. For any x G Vo with ||x|| = 1, we may write 

x = J2f=i c jYj with E^Ti 1 M 2 = 1- Then 

AT-l 

(Bx,Bx) = Y, M 2 |\f < l^l 2 = (By k ,By k ), 
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so the maximum in (1341) is attained at x = y^, giving rj = \\k\ 2 - 



Remark 8.5 A similar argument is used in [5] to control the spectral gaps for a special 
class of doubly stochastic matrices. 

Proof of Theorem^ For Statement 1, see Section 13731 We now consider Statement 2. 

For a given a G Sn, we are interested in the eigenvalues of the matrix $(1) = 
m^ 1 B(m,N)Q(o'), since these the eigenvalues of $(1) (where &{z) is the Fredholm 
matrix of uo /) and therefore the isolated eigenvalues in Spec(£/|Bv)- By Lemma I5TTJ 
it suffices to consider the eigenvalues of m~ l A(m, N)P(a). 

Since we are assuming that gcd(m, N) = 1, the matrix C(jn, N) can be obtained 
from A(m, N) by applying some permutation r to the rows. Thus P(T~ 1 )A(m, N) = 
C(m,N). For any a G S N , the matrix rrT 1 A{m, N)P(a) = mr 1 P{r)C(m ) N)P{a) 
is conjugate to m^ 1 C{m 1 N)P{aT~ l ), so it suffices to consider the eigenvalues of 
the doubly stochastic matrices m~ x C{rn, N)P(a) for all u 6 Sjy. We exclude the 
eigenvalue 1 associated to the trivial eigenvector e, so consider only the eigenvalues on 
its orthogonal complement Vq. 

We apply Lemma [8.41 to the doubly stochastic circulant matrix B = m~ 1 C(m,N). 
For any a G Sn, and any eigenvalue A of BP(cr) on V , we have |A| < ■s/rj, where, by 
Propositions 18.21 and 18.3} 

sin(m7r/iV) 



msin(7r/iV) 



We finish by noting some further consequences of Proposition 18.21 

Proposition 8.6 Suppose that gcd(m,N) = 1. Then for any a G Sn, the matrix 
A(m, N)P(a) has eigenvalue m with (algebraic) multiplicity 1. All its other eigenvalues 
are algebraic integers of norm ±1. Composition with o preserves the mixing rate of f 
(that is, A a = 1/m in the notation of Section VJ.'J) if and only if these algebraic integers 
are roots of unity. 

Proof. Clearly the characteristic polynomial of A(m, N)P(a) has integer coefficients 
and has leading coefficient 1, i.e. its roots are algebraic integers. If A is any one of these 
eigenvalues, then its conjugates are also eigenvalues, and its norm (i.e. the product 
of its conjugates) must be a rational integer. Now the product of the eigenvalues 
is ±det(A(m,N)P(a)) = ±det(C(m, N)) det(P(rer)) = ±m since any permutation 
matrix has determinant ±1. We have the obvious eigenvalue m (with eigenvector e), 
so m has multiplicity 1 as a root of the characteristic polynomial, and all the other 
roots must have norm ±1. 

Now if all the eigenvalues A ^ m of A{m, N)P(a) are roots of unity, we have | A| = 1. 
Thus no element of Spec(£/|Bv) has modulus between m _1 and 1, and a o / has the 
same mixing rate as /. Conversely, suppose that a o f and / have the same mixing 
rate. Then we must have |A| < 1 for all eigenvalues A ^ m of A(m, N)P(cr). But then 
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all the conjugates A' of A are again eigenvalues, and hence satisfy |A'| < 1. In fact each 
| A' | = 1, since the product of the A' is ±1. Now any algebraic integer all of whose 
conjugates have modulus 1 must be a root of unity (see e.g. [3 IV, (4,5a)]). Hence all 
the eigenvalues A/mof A(m, N)P(a) are roots of unity. I 

From a purely algebraic point of interest, we make one further remark about the 
transition matrices. 

Remark 8.7 When gcd(m, N) > 1, the matrix A(m, N) has as an eigenvalue. Since 
we recover A(m,N) from B(m,N) by factoring out the full nullspace of B(m,N), 
it follows that B(m,N) is not diagonalisable; it is conjugate to a matrix in Jordan 
Canonical Form with at least one Jordan block of size > 1 for the eigenvalue 0. 
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